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Abstract 

Free probability theory was created by Dan Voiculescu around 1985, motivated 
by his efforts to understand special classes of von Neumann algebras. His dis- 
covery in 1991 that also random matrices satisfy asymptotically the freeness 
relation transformed the theory dramatically. Not only did this yield spectac- 
ular results about the structure of operator algebras, but it also brought new 
concepts and tools into the realm of random matrix theory. In the following 
we will give, mostly from the random matrix point of view, a survey on some 
of the basic ideas and results of free probability theory. 



22.1 Introduction 

Free probability theory allows one to deal with asymptotic eigenvalue distribu- 
tions in situations involving several matrices. Let us consider two sequences An 
and -Bv of selfadjoint N x N matrices such that both sequences have an asymp- 
totic eigenvalue distribution for TV — » oo. We are interested in the asymptotic 
eigenvalue distribution of the sequence /(An,Bn) for some non-trivial self- 
adjoint function /. In general, this will depend on the relation between the 
eigenspaces of An and of Bjy. However, by the concentration of measure phe- 
nomenon, we expect that for large N this relation between the eigenspaces 
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concentrates on typical or generic positions, and that then the asymptotic 
eigenvalue distribution of /(An, Bn) depends in a deterministic way only on 
the asymptotic eigenvalue distribution of An and on the asymptotic eigen- 
value distribution of Bn- Free probability theory replaces this vague notion of 
generic position by the mathematical precise concept of freeness and provides 
general tools for calculating the asymptotic distribution of /(An,Bn) out of 
the asymptotic distribution of An and the asymptotic distribution of Bn- 

22.2 The Moment Method for Several Random Ma- 
trices and the Concept of Freeness 

The empirical eigenvalue distribution of a selfadjoint N x N matrix A is the 
probability measure on R which puts mass l/N on each of the N eigenvalues 
Aj of A, counted with multiplicity. If ha is determined by its moments then it 
can be recovered from the knowledge of all traces of powers of A: 

tr(A k ) = ±{\ k 1 + ... + \ k N ) = [ t k d^ A (t), 

where by tr we denote the normalized trace on matrices (so that we have for 
the identity matrix 1 that tr(l) = 1). This is the basis of the moment method 
which tries to understand the asymptotic eigenvalue distribution of a sequence 
of matrices by the determination of the asymptotics of traces of powers. 

Definition 22.2.1 We say that a sequence (An)ng^ of N x N matrices has 
an asymptotic eigenvalue distribution if the limit lim7v->oo ^ T (A%) exists for all 

ken. 

Consider now our sequences An and Bn, each of which is assumed to have 
an asymptotic eigenvalue distribution. We want to understand, in the limit 
N — ► oo, the eigenvalue distribution of /(An,Bn), not just for one /, but for 
a wide class of different functions. By the moment method, this asks for the 
investigation of the limit N — > oo of ti(f(AN, BN) k ) for all k G N and all / 
in our considered class of functions. If we choose for the latter all polynomials 
in non-commutative variables, then it is clear that the basic objects which we 
have to understand in this approach are the asymptotic mixed moments 

hm tr(A#57---A#B£ fc ) (A; G N; n u - - - , n k , m u . . . , m k G N). 

(22.2.1) 

Thus our fundamental problem is the following. If An and Bn each have an 
asymptotic eigenvalue distribution, and if An and Bn are in generic position, 
do the asymptotic mixed moments lim/v^oo ^(A^B 7 ^ 1 ■■■A N k B r ^ l k ) exist? If 
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so, can we express them in a deterministic way in terms of 

lim tv(ip(A k N ))) and ( lim tr(B&) J . (22.2.2) 

N -*°° J km \n^oo J fcgN 

Let us start by looking on the second part of the problem, namely by trying 
to find a possible relation between the mixed moments (|22,2.ip and the moments 
(122, 2, 2D . For this we need a simple example of matrix sequences A n and Bn 
which we expect to be in generic position. 

Whereas up to now we have only talked about sequences of matrices, we 
will now go over to random matrices. Namely, it is actually not clear how 
to produce two sequences of deterministic matrices whose eigenspaces are in 
generic position. However, it is much easier to produce two such sequences of 
random matrices for which we have almost surely a generic situation. Indeed, 
consider two independent random matrix ensembles An and Bn, each with 
almost surely a limiting eigenvalue distribution, and assume that one of them, 
say -Bat, is a unitarily invariant ensemble, which means that the joint distribu- 
tion of its entries does not change under unitary conjugation. This implies that 
taking U~nBnU n , for any unitary N x iV-matrix Un, instead of Bn does not 
change anything. But then we can use this Un to rotate the eigenspaces of Bn 
against those of An into a generic position, thus for typical realizations of An 
and Bn the eigenspaces should be in a generic position. 

The simplest example of two such random matrix ensembles are two inde- 
pendent Gaussian random matrices An and Bn- In this case one can calculate 
everything concretely: in the limit N — > oo, tr(A^ B 1 ^ 1 ■ ■ ■ A N *B r ^ k ) is almost 
surely given by the number of non-crossing or planar pairings of the pattern 

A ■ A ■ ■ ■ A - B ■ B 1 ■ • • B - ■ - A ■ A- --A - B-B--- B , 

ni-times mi-times n^-times m^-times 

which do not pair A with B. (A pairing is a decomposition of the pattern into 
pairs of letters; if we connect the two elements from each pair by a line, drawn 
in the half-plane below the pattern, then non-crossing means that we can do 
this without getting crossings between lines for different pairs.) 

After some contemplation, it becomes obvious that this implies that the 
trace of a corresponding product of centered powers, 



Jim tr^A* - Jjmte^) • l) • - JjmteW) • l) • • • 

• • • M " J™ tr(A£) • 1) • (BT ~ J™ tr(S^) • l)) (22.2.3) 

M — >oo M — >oo / 

is given by the number of non-crossing pairings which do not pair A with B and 
for which, in addition, each group of ^4's and each group of B's is connected 
with some other group. It is clear that if we want to connect the groups in 
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this way we will get some crossing between the pairs, thus there are actually 
no pairings of the required form and we have that the term (|22.2.3|) is equal to 
zero. 

One might wonder what advantage is gained by trading the explicit formula 
for mixed moments of independent Gaussian random matrices for the implicit 
relation (|22.2.3|) ? The drawback to the explicit formula for mixed moments 
of independent Gaussian random matrices is that the asymptotic formula for 
tr(A^ B^ 1 ■ ■ ■ A r ^B T ^ k ) will be different for different random matrix ensembles 
(and in many cases an explicit formula fails to exist). However, the vanishing 
of (|22.2.3|) remains valid for many matrix ensembles. The vanishing of (122. 2. 3p 
gives a precise meaning to our idea that the random matrices should be in 
generic position; it constitutes Voiculescu's definition of asymptotic freeness. 

Definition 22.2.2 Two sequences of matrices (An)ngn and (Bn)ngn are asymp- 
totically free if we have the vanishing of (|22.2.3p for all k > 1 and all ni,mi, 
• • • ,n k ,m k > 1. 

Provided with this definition, the intuition that unitarily invariant random 
matrices should give rise to generic situations becomes now a rigorous theorem. 
This basic observation was proved by Voiculescu [Voi91| in 1991. 

Theorem 22.2.3 Consider N x N random matrices An and Bn such that: 
both A^ and Bn have almost surely an asymptotic eigenvalue distribution for 
N — ► oo; An and Bn are independent; Bn is a unitarily invariant ensemble. 
Then, An and Bn are almost surely asymptotically free. 

In order to prove this, one can replace A n and Bn by An and UnBnU n , 
where Un is a Haar unitary random matrix (i.e., from the ensemble of unitary 
matrices equipped with the normalized Haar measure as probability measure); 
furthermore, one can then restrict to the case where A n and Bn are deter- 
ministic matrices. In this form it reduces to showing almost sure asymptotic 
freeness between Haar unitary matrices and deterministic matrices. The proof 
of that statement proceeds then as follows. First one shows asymptotic freeness 
in the mean and then one strengthens this to almost sure convergence. 

The original proof of Voiculescu [Vbi91j for the first step reduced the asymp- 
totic freeness for Haar unitary matrices to a corresponding statement for non- 
selfadjoint Gaussian random matrices; by realizing the Haar measure on the 
group of unitary matrices as the pushforward of the Gaussian measure under 
taking the phase. The asymptotic freeness result for Gaussian random matrices 
can be derived quite directly by using the genus expansion for their traces. An- 
other more direct way to prove the averaged version of unitary freeness for Haar 
unitary matrices is due to Xu |Xu97| and relies on Weingarten type formulas 
for integrals over products of entries of Haar unitary matrices. 
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In the second step, in order to strengthen the above result to almost sure 
asymptotic freeness one can either [Voi91j invoke concentration of measure re- 
sults of Gromov and Milman (applied to the unitary group) or |Spe93 [HiaOOj 



more specific estimates for the variances of the considered sequence of random 
variables. 

Though unitary invariance is the most intuitive reason for having asymptotic 
freeness among random matrices, it is not a necessary condition. For example, 
the above theorem includes the case where Bn are Gaussian random matrices. 
If we generalize those to Wigner matrices (where the entries above the diagonal 
are i.i.d, but not necessarily Gaussian), then we loose the unitary invariance, 
but the conclusion of the above theorem still holds true. More precisely, we 
have the following theorem. 

Theorem 22.2.4 Let Xn be a selfadjoint Wigner matrix, such that the distri- 
bution of the entries is centered and has all moments, and let An be a random 
matrix which is independent from Xn- If An has almost surely an asymptotic 
eigenvalue distribution and if we have 

sup ||A/v|| < oo, 
NeN 

then An and Xn are almost surely asymptotically free. 

The case where An consists of block diagonal matrices was treated by 
Dykema [Dyk93|, for the general version see [MinlO, AndlO]. 



22.3 Basic Definitions 

The freeness relation, which holds for many random matrices asymptotically, 
was actually discovered by Voiculescu in a quite different context; namely 
canonical generators in operator algebras given in terms of free groups sat- 
isfy the same relation with respect to a canonical state, see Section 122.9.11 
Free probability theory investigates these freeness relations abstractly, inspired 
by the philosophy that freeness should be considered and treated as a kind of 
non-commutative analogue of the classical notion of independence. 

Some of the main probabilistic notions used in free probability are the fol- 
lowing. 

Notation 22.3.1 A pair (A,<p) consisting of a unital algebra A and a linear 
functional tp : A — > C with <p(l) = 1 is called a non-commutative probabil- 
ity space. Often the adjective "non-commutative" is just dropped. Elements 
from A are addressed as (non-commutative) random variables, the numbers 
y( a i(i) " " " a i{n)) f or such random variables ai, . . . , a& S A are called moments, 
the collection of all moments is called the joint distribution of ai, . . . , a&. 
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Definition 22.3.2 Let (A,<p) be a non-commutative probability space and let 
I be an index set. 

1) Let, for each i G / , A% C A, be a unital subalgebra. The subalgebras 
(Ai)i£i are called free or freely independent, if ip(a\ ■ ■ ■ a^) = whenever we 
have: k is a positive integer; aj G <Ai{j) (with £ I) for all j = 1, ...,k; 
<p(a,j) = for all j = l,...,k; and neighboring elements are from different 
subalgebras, i.e., i(l) ^ i(2),i(2) ^ i(3), . . . , i(k — 1) ^ i(k). 

2) Let, for each i G / , a, t G A. The elements (ai)i & j are called free or freely 
independent, if their generated unital subalgebras are free, i.e., if (Ai)i^i are 
free, where, for each i G I, Ai is the unital subalgebra of A which is generated 
by ai. 

Preeness, like classical independence, is a rule for calculating mixed mo- 
ments from knowledge of the moments of individual variables. Indeed, one 
can easily show by induction that if (Ai)i£i are free with respect to (p, then 
ip restricted to the algebra generated by all Ai, i £ I, is uniquely determined 
by (f\Ai f° r all i G / and by the freeness condition. For example, if A and 
B are free, then one has for a,a\,a2 G A and 6,61,62 £ 13 that (p(ab) = 
(p(a)ip(b), ip(aiba 2 ) = ip(aia 2 )(p(b), and (/?(ai6ia 2 6 2 ) = (/?(aia 2 )¥?(6i)^(6 2 ) + 
V9(ai)(^(a 2 )(^(6i6 2 ) — (/j(ai)<^(6i)9j(a 2 )9?(6 2 ). Whereas the first two factoriza- 
tions are the same as for the expectation of independent random variables, the 
last one is different, and more complicated, from the classical situation. It is 
important to note that freeness plays a similar role in the non-commutative 
world as independence plays in the classical world, but that freeness is not 
a generalization of independence: independent random variables can be free 
only in very trivial situations. Freeness is a theory for non-commuting random 
variables. 

22.4 Combinatorial Theory of Preeness 

The defining relations for freeness from Def. 122.3.21 are quite implicit and not 
easy to handle directly. It has turned out that replacing moments by other 
quantities, so-called free cumulants, is advantageous for many questions. In 
particular, freeness is much easier to describe on the level of free cumulants. 
The relation between moments and cumulants is given by summing over non- 
crossing partitions. This combinatorial theory of freeness is due to Speicher 
|Spe94| ; many consequences of this approach were worked out by Nica and 
Speicher, see [NIc06] . 

Definition 22.4.1 For a unital linear functional ip : A — > C on a unital algebra 
A we define cumulant functionals n n : A n — > C (for all n > 1) by the moment- 
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cumulant relations 



<f{a\ ■■■a, 



■n 



tr&NC{n) 



(22.4.1) 



In equation (|22.4.ip the summation is running over non- crossing partitions 
of the set {ai, 02, ■ ■ ■ , a n }; those are decompositions of that set into disjoint non- 
empty subsets, called blocks, such that there are no crossings between different 
blocks. In diagrammatic terms this means that if we draw the blocks of such a 
7r below the points a\, 0,2, ■ ■ ■ , a n , then we can do this without having crossings 
in our picture. The contribution in (|22.4.ip of such a non-crossing tt is a 
product of cumulants corresponding to the block structure of tt. For each block 
of tt we have as a factor a cumulant which contains as arguments those cij which 
are connected by that block. 

An example of a non-crossing partition tt for n = 10 is 



In this case the blocks are {ai,aio}, {02,05,09}, {03,04}, {a^}, and {07,0s}; 
and the corresponding contribution K n in (|22.4.ip is given by 



Note that in general there is only one term in (|22.4.ip involving the highest 
cumulant K n , thus the moment cumulant formulas can inductively be resolved 
for the K n in terms of the moments. More concretely, the set of non-crossing 
partitions forms a lattice with respect to refinement order and the K n are given 
by the Mobius inversion of the formula (|22.4.ip with respect to this order. 

For n = 1, we get the mean, Ki(a\) = tp{a\) and for n = 2 we have the 
covariance, ^2(01,02) = tp{a\a2) — tp(ai)ip{a,2) ■ 

The relevance of the K n in our context is given by the following characteri- 
zation of freeness. 

Theorem 22.4.2 Freeness is equivalent to the vanishing of mixed cumulants. 
More precisely, the fact that (oj)j g / are free is equivalent to: Kniflitt)-* ■ ■ ■ >°i(n)) = 
whenever n>2 and there are k, I such that i{k) ^ i(l). 

This description of freeness in terms of free cumulants is related to the 
planar approximations in random matrix theory. In a sense some aspects of 
this theory of freeness were anticipated (but mostly neglected) in the physics 
community in the paper |Cyi82j . 



Oi 02 03 04 05 06 O7 OS OgOiQ 




K n [ai 



■ ■ ■ ,aw] = K 2 {ai,ai ) ■ K 3 (a 2 ,a 5 ,a 9 ) ■ K 2 {a 3 ,a 4: ) ■ Ki(a e ) ■ K 2 {a 7 ,a s ). 
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22.5 Free Harmonic Analysis 

For a meaningful harmonic analysis one needs some positivity structure for the 
non-commutative probability space (A, ip). We will usually consider selfadjoint 
random variables and cp should be positive. Formally, a good frame for this is a 
C* -probability space, where A is a C*-algebra (i.e., a norm-closed *-subalgebra 
of the algebra of bounded operators on a Hilbert space) and ip is a state, i.e. it 
is positive in the sense <p(aa*) > for all a £ A. Concretely this means that 
our random variables can be realized as bounded operators on a Hilbert space 
and (p can be written as a vector state y>{a) = (a£, £) for some unit vector £ in 
the Hilbert space. 

In such a situation the distribution of a selfadjoint random variable a can 
be identified with a compactly supported probability measure [i a on R, via 



<p(a n ) = I t n diJL a {t) for all n £ N. 
Jr 



22.5.1 Sums of free variables: the 7^-transform 

Consider two selfadjoint random variables a and b which are free. Then, by 
freeness, the moments of a + b are uniquely determined by the moments of a 
and the moments of b. 

Notation 22.5.1 We say the distribution of a + b is the free convolution, de- 
noted by EE, of the distribution of a and the distribution of b, 

Va+b = EH Hb- 

Notation 22.5.2 For a random variable a we define its Cauchy transform G 
and its 7£-transform 1Z by 

G ^ = - + E and n ^ = E K «( a > ■ ■ ■ > a > n ~ 1 - 

n=l n=l 

One can see quite easily that the moment-cumulant relations (|22.4.1|) are equiv- 
alent to the following functional relation 

' n{G{z)) = z. (22.5.1) 



G{z 



Combined with the additivity of free cumulants under free convolution, which 
follows easily by the vanishing of mixed cumulants in free variables, this yields 
the following basic theorem of Voiculescu. 
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Theorem 22.5.3 Let G(z) be the Cauchy-transform of a, as defined in Nota- 
tion \22.57B and define its TZ-transform by the relation (|22.5,ip . Then we have 

n a+h {z) = n a {z) + n b (z) 

if a and b are free. 

We have defined the Cauchy and the 7£-transform here only as formal power 
series. Also (|22.5,ip is proved first as a relation between formal power series. 
But if a is a selfadjoint element in a C*-probability space, then G is also the 
analytic function 

G : C+ -» C-; G(z) = V ( — ) = [ J—dfi a (t); 

\z-aj J R z-t 

and one can also show that (|22,5.ip defines then TZ as an analytic function 
on a suitably chosen subset of C + . In this form Theorem 122.5.31 is amenable 
to analytic manipulations and so gives an effective algorithm for calculating 
free convolutions. This can be used to calculate the asymptotic eigenvalue 
distribution of sums of random matrices which are asymptotically free. 

Furthermore, by using analytic tools around the Cauchy transform (which 
exist for any probability measure on R) one can extend the definition of and 
most results on free convolution to all probability measures on R. See |Ber931 
IVoiOO] for more details. 

We would like to remark that the machinery of free convolution was also 
found around the same time, independently from Voiculescu and independently 
from each other, by different researchers in the context of random walks on the 
free product of groups: by Woess, by Cartwright and Soardi, and by McLaugh- 
lin; see, for example, [Woe86| . 

22.5.2 Products of free variables: the ^-transform 

Consider a, b free. Then, by freeness, the moments of ab are uniquely deter- 
mined by the moments of a and the moments of b. 

Notation 22.5.4 We say the distribution of ab is the free multiplicative con- 
volution, denoted by M, of the distribution of a and the distribution ofb, 

Note: even if we start from selfadjoint a and b, their product ab is not 
selfadjoint, unless a and b commute (which is rarely the case, when a and b are 
free). Thus the above does not define an operation on probability measures on 
R in general. However, if one of the operators, say a, is positive (and thus \x a 
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supported on then a 1//2 6a 1 / 2 makes sense; since it has the same moments 
as ab (note for this that the relevant state is a trace, as the free product of traces 
is tracial) we can identify fi a b then with the probability measure // a i/2 6a i/2. 

Again, Voiculescu introduced an analytic object which allows to deal effec- 
tively with this multiplicative free convolution. 

Theorem 22.5.5 Put M a (z) := Em=o ^(a" 1 )^" 1 and de fi ne the ^-transform 
of a by 

S a (z) : = ilfM^C*), 
where M <_1> denotes the inverse of M under composition. Then we have 

Sab(z) = S a (z) ■ S b (z) 

if a and b are free. 

As in the additive case, the moment generating series M and the S'-transform 
are not just formal power series, but analytic functions on suitably chosen do- 
mains in the complex plane. For more details, see |Ber93l iHiaOOj. 

22.5.3 The free central limit theorem 

One of the first theorems in free probability theory, proved by Voiculescu in 
1985, was the free analogue of the central limit theorem. Surprisingly, it turned 
out that the analogue of the Gaussian distribution in free probability theory is 
the semicircular distribution. 

Definition 22.5.6 Let (A,(p) be a C* -probability space. A selfadjoint element 
s £ A is called semicircular ( of variance 1) if its distribution [i s is given by the 
probability measure with density ^:V4 — t 2 on the interval [—2, +2]. Alterna- 
tively, the moments of s are given by the Catalan numbers, 

^ (s n) = I fcTT Ck ) > if n = 2k even 
1 0, if n odd 

Theorem 22.5.7 If v is a compactly supported probability measure on K with 
vanishing mean and variance 1, then 

where D a denotes the dilation of a measure by the factor a, and => means weak 
convergence. 
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By using the analytic theory of EE for all, not necessarily compactly supported, 
probability measures on M, the free central limit theorem can also be extended 
to this general situation. 

The occurrence of the semicircular distribution as limit both in Wigner's 
semicircle law as well as in the free central limit theorem was the first hint of 
a relationship between free probability theory and random matrices. The sub- 
sequent development of this connection culminated in Voiculescu's discovery of 
asymptotic freeness between large random matrices, as exemplified in Theorem 
122.2.31 When this contact was made between freeness and random matrices, 
the previously introduced TZ- and S'-transforms gave powerful new techniques 
for calculating asymptotic eigenvalue distributions of random matrices. For 
computational aspects of these techniques we refer to [Rao09 , for applications 
in electrical engineering see [Tul04j . and also Chapter 40. 
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Figure 22.1: Comparison of free probability result with histogram of eigenvalues 
of an N x N random matrix, for N = 2000: (i) histogram of the sum of inde- 
pendent Gaussian and Wishart matrices, compared with the free convolution 
of semicircular and free Poisson distribution (rate A = 1/2), calculated by us- 
ing the 7£-transform; (ii) histogram of the product of two independent Wishart 
matrices, compared with the free multiplicative convolution of two free Poisson 
distributions (both with rate A = 5), calculated by using the 5-transform 

22.5.4 Free Poisson distribution and Wishart matrices 

There exists a very rich free parallel of classical probability theory, of which the 
free central limit theorem is just the starting point. In particular, one has the 
free analogue of infinitely divisible and of stable distributions and corresponding 
limit theorems. For more details and references see [Bcr99, VoiOOj. 

Let us here only present as another instance of this theory the free Poisson 
distribution. As with the semicircle distribution, the free counterpart of the 
Poisson law, which is none other than the Marchenko-Pastur distribution, ap- 
pears very naturally as the asymptotic eigenvalue distribution of an important 
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class of random matrices, namely Wishart matrices. 

As in the classical theory the Poisson distribution can be described by a 
limit theorem. The following statement deals directly with the more general 
notion of a compound free Poisson distribution. 

Proposition 22.5.8 Let A > and v a probability measure on M with compact 
support. Then the weak limit for N — > oo of 



has free cumulants {n n )n>i which are given by n n = A • m n (y) (n > 1) (m, 
denotes here the n-th moment) and thus an IZ-transform of the form 



Definition 22.5.9 The probability measure appearing in the limit of Prop. 
\22.5.8\ is called a compound free Poisson distribution with rate A and jump 
distribution v. 

Such compound free Poisson distributions show up in the random matrix 
context as follows. Consider: rectangular Gaussian M x N random matrices 
Xm,n, where all entries are independent and identically distributed according 
to a normal distribution with mean zero and variance 1/N; and a sequence of 
deterministic N x N matrices Tv such that the limiting eigenvalue distribution 
Ht of T/v exists. Then almost surely, for M, N — > oo such that N/M — ► A, the 
limiting eigenvalue distribution of Xm,nTnX* m N exists, too, and it is given by 
a compound free Poisson distribution with rate A and jump distribution jit- 

One notes that the above frame of rectangular matrices does not fit directly 
into the theory presented up to now (so it is, e.g., not clear what asymptotic 
freeness between Xm,n and T/v should mean). However, rectangular matrices 
can be treated in free probability by either embedding them into bigger square 
matrices and applying some compressions at appropriate stages or, more di- 
rectly, by using a generalization of free probability due to Benaych-Georges 
[Ben09| which is tailor-made to deal with rectangular random matrices. 

22.6 Second Order Freeness 

Asymptotic freeness of random matrices shows that the mixed moments of two 
ensembles in generic position are deterministically calculable from the moments 
of each individual ensemble. The formula for the calculation of the mixed mo- 
ments is the essence of the concept of freeness. The same philosophy applies 
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also to finer questions about random matrices, most notably to global fluctua- 
tions of linear statistics. With this we mean the following: for many examples 
(like Gaussian or Wishart) of random matrices the magnified fluctuations 
of traces around the limiting value, N(ti(A l ^ f ) — liniM^oo tr(A^)) , form asymp- 
totically a Gaussian family. If we have two such ensembles in generic position 
(e.g., if they are independent and one of them is unitarily invariant), then this 
is also true for mixed traces and the covariance of mixed traces is determined 
in a deterministic way by the covariances for each of the two ensembles sepa- 
rately. The formula for the calculation of the mixed covariances constitutes the 
definition of the concept of second order freeness. There exist again cumulants 
and an i?-transform on this level, which allow explicit calculations. For more 
details, see |Gol071 iMmTU] . 

22.7 Operator- Valued Free Probability Theory 

There exists a generalization of free probability theory to an operator- valued 
level, where the complex numbers C and the expectation state (p : A — > C are 
replaced by an arbitrary algebra B and a conditional expectation E : A —* B. 
The formal structure of the theory is very much the same as in the scalar- 
valued case, one only has to take care of the fact that the "scalars" from B do 
not commute with the random variables. 

Definition 22.7.1 1 ) Let Abe a unital algebra and consider a unital subalgebra 
B C A. A linear map E : A — > B is a conditional expectation if E[b] = b for all 
b £ B and E\b\db2\ = b\E[a\b2 for all a £ A and all 61,62 £ B. An operator- 
valued probability space consists of B C A and a conditional expectation E : 
A^B. 

2) Consider an operator-valued probability space B C A, E : A — > B. Ran- 
dom variables (xj)i e / C A are free with respect to E or free with amalgamation 
over B if -Efpi^jm) • ■ ■ Pfc(»j(fc))] = 0, whenever k £ N, pj are elements from 
the algebra generated by B and neighboring elements are different, i.e., 

i(l) 7^ i{2) 7^ • • • 7^ i(k), and we have E\pj{xi^\\ = for all j = 1, . . . , k. 

Voiculescu introduced this operator- valued version of free probability theory 
in [Voi85] and provided in [Voi95j also a corresponding version of free convolu- 
tion and 7£-transform. A combinatorial treatment was given by Speicher |Spe98| 
who showed that the theory of free cumulants has also a nice counterpart in 
the operator-valued frame. 

For a £ A we define its (operator-valued) Cauchy transform G a : B — » B by 

G a (b) : =E[-^-] = ^E[b- 1 (ab- 1 n 
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The operator- valued IZ-transform of a, lZ a : B — > ,8, can be defined as a power 
series in operator-valued free cumulants, or equivalently by the relation bG(b) = 
1 + H(G(b)) ■ G{b) or G(b) = (b - ft(G(&))) -1 . One has then as before: If x 
and y are free over B, then 1Z x+y {b) = TZ x (b) + !Z y (b). Another form of this is 
the subordination property G x +y{b) = G x [b — 1Z y (G x + y (b))~\ ■ 

There exists also the notion of a semicircular element s in the operator- 
valued world. It is characterized by the fact that only its second order free 
cumulants are different from zero, or equivalently that its 7£-transform is of the 
form 1Z s (b) = rj(b), where rj : B — * B is the linear map given by rj(b) = E[sbs]. 
Note that in this case the equation for the Cauchy transform reduces to 

bG(b) = l + rj[G(b)]-G(b); (22.7.1) 

more generally, if we add aniGB, for which we have G x (b) = E[(b — x) _1 ] = 
(b — x) _1 , we have for the Cauchy transform of x + s the implicit equation 

G x+S {b) = G x [b-K S (G X+S (b))] = {b - V [G X+S (b)} -x)-\ (22.7.2) 

It was observed by Shlyakhtenko [Shl96] that operator- valued free probabil- 
ity theory provides the right frame for dealing with more general kind of random 
matrices. In particular, he showed that so-called band matrices become asymp- 
totically operator-valued semicircular elements over the limit of the diagonal 
matrices. 

Theorem 22.7.2 Suppose that An = A* N is an N x N random band matrix, 
i.e., A^ = (<Hj)ij—i, where {aij \ i < j} are centered independent complex 



i r\l o t? inn-fh H I / , - - , , , , , , , 

Jl r- TOOftn H2l 



Gaussian random variables, with E[aijdij] = (1 + dija 2 (i/N, j /N))/N for some 



a £ L°°([0, 1] ). Let Bn be the diagonal N x N matrices, and embed Bn into 
B := L°°[0, 1] as step functions. Let Bn S Bm be selfadjoint diagonal matrices 
such that Bjy — ► / G L°°[0, 1] in \\ • H^. Then the limit distribution of Bn + An 
exists, and its Cauchy transform G is given by 

G ( z ) = / g(z,x)dx, 
Jo 



where g(z, x) is analytic in z and satisfies 



g{z,x) 



f{x) - I a 2 (y,x)g(z,y)dy 



o 



(22.7.3) 



Note that (|22,7.3|) is nothing but the general equation (|22.7.2|) specified to 
the situation B = L°°[0, 1] and i] : L°°[0, 1] — > L°°[0, 1] acting as integration 
operator with kernel a 2 . 

Moreover, Gaussian random matrices with a certain degree of correlation 
between the entries are also asymptotically semicircular elements over an ap- 
propriate subalgebra, see [Ras08j. 
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22.8 Further Free-Probabilistic Aspects of Random 
Matrices 

Free probability theory provides also new ideas and techniques for investigating 
other aspects of random multi-matrix models. In particular, Haagerup and 
Thorbjornsen [HaaQ2], [Haa05j obtained a generalization to several matrices for 
a number of results concerning the largest eigenvalue of a Gaussian random 
matrix. 

Much work is also devoted to deriving rigorous results about the large TV" 
limit of random multi-matrix models given by densities of the type 

c N e- NHrP ^- A ^dX(A 1 ,...,A n ), 

where dX is Lebesgue measure, A±, . . . , A n are selfadjoint N x N matrices, and 
P a noncommutative selfadjoint polynomial. To prove the existence of that 
limit in sufficient generality is one of the big problems. For a mathematical 
rigorous treatment of such questions, see [Gui06j . 

Free Brownian motion is the large N limit of the Dyson Brownian motion 
model of random matrices (with independent Brownian motions as entries, 
compare Chapter 11). Free Brownian motion can be realized concretely in 
terms of creation and annihilation operators on a full Fock space (see section 
122.9. ip . There exists also a corresponding free stochastic calculus [Bia98j; for 
applications of this to multi- matrix models, see [Gui07] . 

There is also a surprising connection with the representation theory of the 
symmetric groups S n . For large n, representations of S n are given by large 
matrices which behave in some respects like random matrices. This was made 
precise by Biane who showed that many operations on representations of the 
symmetric group can asymptotically be described by operations from free prob- 
ability theory, see [Bia02] . 



22.9 Operator Algebraic Aspects of Free Probability 

A survey on free probability without the mentioning of at least some of its 
operator algebraic aspects would be quite unbalanced and misleading. We 
will highlight some of these operator algebraic facets in this last section. For 
the sake of brevity, we will omit the definitions of standard concepts from 
operator algebras, since these can be found elsewhere (we refer the reader to 
[Voi92, Voi05, HiaOOj for more information on notions, as well as for references 
related to the following topics). 
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22.9.1 Operator Algebraic Models for Freeness 
Free group factors 

Let G = *i e iGi be the free product of groups G«. Let L(G) denote the group 
von Neumann algebra of G, and ip the associated trace state, corresponding 
to the neutral element of the group. Then L(Gi) can be identified with a 
subalgebra of L{G) and, with respect to ip, these subalgebras (L(Gi))i£i are 
free. This freeness is nothing but the rewriting in terms of ip what it means 
that the groups Gi are free as subgroups in G. The definition of freeness was 
modeled according to the situation occurring in this example. The free in free 
probability theory refers to this fact. 

A special and most prominent case of these von Neumann algebras are the 
free group factors L(¥ n ), where F„ is the free group on n generators. One 
hopes to eventually be able to resolve the isomorphism problem: whether the 
free groups factors L(¥ n ) and L(¥ m ) are, for n,m>2, isomorphic or not. 

Creation and annihilation operators on full Fock spaces 

Let Ti be a Hilbert space. The full Fock space over Ti is defined as J-(Ti) := 
©r=o^ 8r1, The summand Ti®° on the right-hand side of the last equation is 
a one-dimensional Hilbert space. It is customary to write it in the form Cfi 
for a distinguished vector of norm one, which is called the vacuum vector. The 
vector state on B(T(Ti)) given by the vacuum vector, th(T) := (TQ,Q) 
(T 6 B(T(Ti))), is called vacuum expectation state. 

For each ( £ H, the operator Z(£) £ B(J-(Ti)) determined by the formula 
= £ and ■ ■ - ®£n = £ <S) <S) ■ ■ ■ <S) £ n for all n > 1, ...,£„€ W, is 

called the (left) creation operator given by the vector £. As one can easily verify, 
the adjoint of Z(£) is described by the formula: = 0, Z(£)*£i = (6,0^' 

and Z(£)*£i <S> • • • <S> £ n = ® • • • <8> £n and is called the (7e/£j annihilation 

operator given by the vector £. 

The relevance of these operators comes from the fact that orthogonality 
of vectors translates into free independence of the corresponding creation and 
annihilation operators. 

Proposition 22.9.1 Let H be a Hilbert space and consider the probability space 
(B^iTi.)),^). Let Tii, ■ ■ ■ ,7~Ck be a family of linear subspaces ofTi, such that 
Tii _L Tij for i ^ j (1 < i,j < k). For every 1 < i < k let A% be the unital 
C* -subalgebra of B{T(J-L)) generated by {1(C) '■ £ £ Hi}- Then Ai, ■ ■ ■ ,Ak are 
freely independent in (B^iTi)),^). 

Also semicircular elements show up very canonically in this frame; namely, 
if we put I := Z(£) for a unit vector then l + l* is a semicircular element of 

variance 1. More generally, one has that Z + /(Z*) has 7£-transform TZ(z) = f(z) 
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(for / a polynomial, say). This, together with the above proposition, was the 
basis of Voiculescu's proof of Theorem 122.5.31 Similarly, a canonical realization 
for the S-transform is (1 + l)g(l*), for which one has S(z) = l/g(z). This 
representation is due to Haagerup who used it for a proof of Theorem 122.5.51 

22.9.2 Free Entropy 

Free entropy is, as the name suggests, the counterpart of entropy in free proba- 
bility theory. The development of this concept is at present far from complete. 
The current state of affairs is that there are two distinct approaches to free 
entropy. These should give isomorphic theories, but at present we only know 
that they coincide in a limited number of situations. The first approach to a 
theory of free entropy is via microstates. This goes back to the statistical me- 
chanics roots of entropy via the Boltzmann formula and is related to the theory 
of large deviations. The second approach is microstates free. This draws its 
inspiration from the statistical approach to classical entropy via the notion of 
Fisher information. We will in the following only consider the first approach 
via microstates, as this relates directly with random matrix questions. 

Wigner's semicircle law states that as ./V — > oo the empirical eigenvalue 
distribution ha n of an N x N Gaussian random matrix An converges almost 
surely to the semicircular distribution fiyy, i.e., the probability that ha n is in 
any fixed neighborhood of the semicircle converges to 1. We are now interested 
in the deviations from this: What is the rate of decay of the probability that 
HA N is close to v, where v is an arbitrary probability measure? We expect that 
this probability behaves like e~ N2j ^ , for some rate function I vanishing at the 
semicircle distribution. By analogy with the classical theory of large deviations, 
I should correspond to a suitable notion of free entropy. This heuristics led 
Voiculescu to define in [Voi93] the free entropy x m the case of one variable to 
be 

X (u) = JJ]ag\8- t\dv(s)dv{t) + ~ + ~ log 2vr. 

Inspired by this, Ben-Arous and Guionnet proved in |Ben97j a rigorous version 
of a large deviation for Wigner's semicircle law, where the rate function I(y) 
is, up to a constant, given by — x( v ) + \ f t 2 dv{t). 

Consider now the case of several matrices. By Voiculescu's generalization 
of Wigner's theorem we know that n independent Gaussian random matrices 
A^j\ . . . , A^ converge almost surely to a freely independent family s±, . . . , s n of 
semicircular elements. Similarly as for the case of one matrix, large deviations 
from this limit should be given by 

Prob {(A$, . ..,A%>): distr((A$, . ...ij^ distr( ai , . . . ,a n )} ~ e -^(^-.M 
where I(a\, . . . , a n ) should be related to the free entropy of the random variables 
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oi, . . . , a n . Since the distribution distr(ai, . . . , a n ) of several non-commuting 
random variables ai, . . . , a n is a mostly combinatorial object (consisting of the 
collection of all joint moments of these variables), it is much harder to deal with 
these questions and, in particular, to get an analytic formula for /. Essentially, 
the above heuristics led Voiculescu to the following definition [Voi94b| of a free 
entropy for several variables. 

Definition 22.9.2 Given a tracial W* -probability space (M, r) (i.e., M a von 
Neumann algebra and r a faithful and normal trace), and an n-tuple (oi, . . . , a n ) 
of self adjoint elements in M, put 

r(ai, . . . ,a n ;N,r,e) : = 

{{A 1 ,...,A n ) E M N (C) n sa : \ti(A il ...A ik )-T{a h ...a ik )\ < e 

for all 1 < i±, . . . ,ik < n, 1 < k < r\ 

In words, T(a\, . . . ,a n ; N,r,e) is the set of all n-tuples of N x N selfadjoint 
matrices which approximate the mixed moments of the selfadjoint elements 
ai,...,a n of length at most r to within e. 

Let A denote Lebesgue measure on M/v(C)™ a . Define 

X(ai, . . .,a n ;r,e) := limsup — 5 log A(T(a 1 , . . .,a n ;N,r,e) + -logiV, 

N->oo ^ * 

and 

X(ai, ...,a n ):= lim^x^l,-- • ,a n ;r,e). 
The function \ is called the free entropy. 



Many of the expected properties of this quantity x have been established 
(in particular, it behaves additive with respect to free independence), and there 
have been striking applications to the solution of some old operator algebra 
problems. A celebrated application of free entropy was Voiculescu's proof of 
the fact that free group factors do not have Cartan subalgebras (thus settling a 
longstanding open question). This was followed by several results of the same 
nature; in particular, Ge showed that L(¥ n ) cannot be written as a tensor prod- 
uct of two Hi factors. The rough idea of proving the absence of some property 
for the von Neumann algebra L(F n ) using free entropy is the following string of 
arguments: finite matrices approximating in distribution any set of generators 
of L(¥ n ) should also show an approximate version of the considered property; 
one then has to show that there are not many finite matrices with this approx- 
imate property; but for L(¥ n ) one has many matrices, given by independent 
Gaussian random matrices, which approximate its canonical generators. 
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However, many important problems pertaining to free entropy remain open. 
In particular, we only have partial results concerning the relation to large de- 
viations for several Gaussian random matrices. For more information on those 
and other aspects of free entropy we refer to [Voi 02| IBia03|. IGui04] . 

22.9.3 Other Operator Algebraic Applications of Free Proba- 
bility Theory 

The fact that freeness occurs for von Neumann algebras as well as for random 
matrices means that the former can be modeled asymptotically by the latter and 
this insight resulted in the first progress on the free group factors since Murray 
and von Neumann. In particular, Voiculescu showed that a compression of some 
L(¥ n ) results in another free group factor; more precisely, one has (L(¥ n ))u m = 
-^(F 1+m 2( ra _ 1 )). By introducing interpolated free group factors L(¥ t ) for all 
real t > 1, this formula could be extended by Dykema and Radulescu to any 
real n,m > 1, resulting in the following dichotomy: One has that either all 
free group factors L(¥ n ) n > 2 are isomorphic or that they are pairwise not 
isomorphic. 

There exist also type III versions of the free group factors; these free ana- 
logues of the Araki- Woods factors were introduced and largely classified by 
Shlyakhtenko. 

The study of free group factors via free probability techniques has also 
had an important application to subfactor theory. Not every set of data for a 
subfactor inclusion can be realized in the hyperfinite factor; however, work of 
Shlyakhtenko, Ueda, and Popa has shown that this is possible using free group 
factors. 

By relying on free probability techniques and ideas, Haagerup achieved also 
a crucial break-through on the famous invariant subspace problem: every op- 
erator in a II\ factor whose Brown measure (which is a generalization of the 
spectral measure composed with the trace to non-normal operators) is not con- 
centrated in one point has non-trivial closed invariant subspaces affiliated with 
the factor. 
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